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* NOTICES* 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] The digital-signal-processing equipment carry out having a prediction operation means 
generate the new digital audio signal which comes to change the above-mentioned digital audio 
signal by carrying out the prediction operation of the above-mentioned digital audio signal in the 
digital-signal-processing equipment which changes a digital audio signal by pitch detection 
means detect the pitch of the above-mentioned digital audio signal, class classification means 
classify the class based on the above-mentioned pitch, and the prediction method corresponding 
to the class by which the classification was carried out [ above-mentioned ] as the description. 
[Claim 2] The above-mentioned prediction operation means is digital-signal-processing 
equipment according to claim 1 characterized by using the prediction coefficient currently 
generated by study based on the digital audio signal beforehand considered as a request. 
[Claim 3] The digital-signal-processing approach of carrying out having the prediction operation 
step which generates in the digital-signal-processing approach of changing a digital audio signal, 
in the pitch detection step which detects the pitch of the above-mentioned digital audio signal, 
the class classification step into which the class classifies based on the above-mentioned pitch, 
and the new digital audio signal which come to change the above-mentioned digital audio signal 
by carrying out the prediction operation of the above-mentioned digital audio signal by the 
prediction method corresponding to the class by which the classification was carried out 
[ above-mentioned ] as the description. 

[Claim 4] The digital-signal-processing approach according to claim 3 characterized by using the 
prediction coefficient currently generated by study at the above-mentioned prediction operation 
step based on the digital audio signal beforehand considered as a request. 
[Claim 5] In the study equipment which generates the prediction coefficient used for the 
prediction operation of the above-mentioned transform processing of the digital-signal- 
processing equipment which changes a digital audio signal A student digital audio signal 
generation means to generate the student digital audio signal which degraded the digital audio 
signal concerned from the digital audio signal considered as a request, A pitch detection means 
to detect the pitch of the above-mentioned student digital audio signal, Study equipment 
characterized by having a prediction coefficient calculation means to compute the prediction 
coefficient corresponding to the above-mentioned class based on a class classification means to 
classify the class based on the above-mentioned pitch, and the above-mentioned digital audio 
signal and the above-mentioned student digital audio signal. 

[Claim 6] In the study approach which generates the prediction coefficient used for the 
prediction operation of the above-mentioned transform processing of the digital-signal- 
processing equipment which changes a digital audio signal The student digital audio signal 
generation step which generates the student digital audio signal which degraded the digital audio 
signal concerned from the digital audio signal considered as a request, The pitch detection step 
which detects the pitch of the above-mentioned student digital audio signal, The study approach 
characterized by having the prediction coefficient calculation step which computes the 
prediction coefficient corresponding to the above-mentioned class based on the class 
classification step into which the class is classified based on the above-mentioned pitch, and the 
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above-mentioned digital audio signal and the above-mentioned student digital audio signal. 
[Claim 7] The program storing medium which makes digital-signal-processing equipment perform 
the program contain the prediction operation step which generates in the new digital audio signal 
which comes to change the above-mentioned digital audio signal by carrying out the prediction 
operation of the above-mentioned digital audio signal by the pitch detection step which detects 
the pitch of the above-mentioned digital audio signal, the class classification step into which the 
class is classified based on the above-mentioned pitch, and the prediction method corresponding 
to the class by which the classification was carried out [ above-mentioned ]. 
[Claim 8] The program storing medium which makes study equipment perform the program 
containing the prediction coefficient calculation step which computes the prediction coefficient 
corresponding to the above-mentioned class based on the student digital audio signal generation 
step which generates the student digital audio signal which degraded the digital audio signal 
concerned from the digital audio signal considered as a request, the pitch detection step which 
detect the pitch of the above-mentioned student digital audio signal, the class classification step 
into which the class classifies based on the above-mentioned pitch, and the above-mentioned 
digital audio signal and the above-mentioned student digital audio signal. 



[Translation done.] 



http://www4.ipdljpo.goj 2004/08/27 



1/8 ^-v 



* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to a program storing medium at the digital-signal- 
processing approach, the study approaches, and those equipment lists, and is a rate converter or 
PCM (Pulse Code Modulation). It applies to a program storing medium and is suitable for the 
digital-signal-processing approach of performing interpolation processing of data to a digital 
signal in decode equipment etc., the study approaches, and those equipment lists. 
[0002] 

[Description of the Prior Art] Before inputting a digital audio signal into digital one/analog 
converter conventionally, exaggerated sampling processing which changes a sampling frequency 
by several times the original value is performed. Thereby, the digital audio signal outputted from 
digital one/analog converter is made as [ eliminate / the phase characteristic of an analog anti- 
aliasing filter is kept constant in an audio frequency quantity region, and / the effect of the 
image noise of the digital system accompanying a sampling ]. 

[0003] In this exaggerated sampling processing, the digital filter of a linearity primary (straight 
line) interpolation method is usually used. Such a digital filter generates linear interpolation data 
in quest of the average value of two or more existing data, when a sampling rate changes or data 
are missing. 
[0004] 

[Problem(s) to be Solved by the Invention] However, although the amount of data has become 
precise several times to time amount shaft orientations by linearity linear interpolation as for the 
digital audio signal after exaggerated sampling processing, the frequency band of the digital audio 
signal after exaggerated sampling processing seldom changes to before conversion, and the tone 
quality itself has not improved. Furthermore, since the interpolated data were not necessarily 
generated based on the wave of the analog audio signal in front of A/D conversion, most its 
wave repeatability has not improved. 

[0005] Moreover, although the frequency was changed using the sampling rate converter when 
the digital audio signal from which a sampling frequency differs was dubbed, it was difficult to be 
able to perform only interpolation of linear data but to improve tone quality and wave 
repeatability with a linearity primary digital filter, also by this case. Furthermore, it is the same 
when the data sample of a digital audio signal is missing. 

[0006] This invention was made in consideration of the above point, and tends to propose a 
program storing medium in the digital-signal-processing approach which may improve the wave 
repeatability of a digital signal much more, the study approaches, and those equipment lists. 
[0007] 

[Means for Solving the Problem] Since this technical problem is solved, conversion which was 
adapted for the description of a digital audio signal much more can be performed by detecting 
the pitch of a digital audio signal in this invention, classifying the class based on the detected 
pitch concerned, and having changed the digital audio signal by the prediction method 
corresponding to the classified class. 
[0008] 
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[Embodiment of the Invention] About a drawing, the gestalt of 1 operation of this invention is 
explained in full detail below. 

[0009] In case the audio signal processor 10 raises the sampling rate of a digital audio signal 
(this is called audio data below) or audio data are interpolated in drawin g 1 , it is made as 
[ generate / the audio data near a true value / class classification application processing ]. 
Incidentally, a digital audio signal means the sound signal showing the voice which people and an 
animal utter, the musical-sound signal showing the musical sound which a musical instrument 
emits, and the signal showing other sounds. 

[0010] That is, in the audio signal processor 10, the pitch detecting element 11 detects the pitch 
(fundamental frequency) of the input audio data D10 shown in drawing 2 or drawing 3 supplied 
from the input terminal TIN based on the autocorrelation of the input audio data D10. 
[0011] That is, it is judged that the pitch detecting element 11 detects an autocorrelation value 
for every range of the versatility which it comes to divide into a time-axis field, and it has a pitch 
about the input audio data D10 when the autocorrelation value concerned is larger than a 
predetermined threshold. And the pitch detecting element 1 1 is the pitch period PITT in case 
there is a pitch. And let pitch size and its location be pitch classes. For example, when it has the 
wave as the input audio data D10 show to drawing 2 , the pitch detecting element 1 1 detects 
front pitch PIT1 and the back pitch PIT 3 in the location which adjoins before and after that to 
the current pitch PIT 2. Incidentally, front pitch PIT1 and the back pitch PIT 3 which the pitch 
detecting element 1 1 detects do not necessarily restrict adjoining the current pitch PIT 2, but 
also have a predetermined distance detached building ****** case from the current pitch PIT 2. 
[0012] The pitch detecting element 11 will supply the logging control signal CONT11 for starting 
these to the adjustable class classification section extract section 12 and the adjustable 
prediction operation part extract section 13, if front pitch PIT1 and the back pitch PIT 3 
corresponding to the current pitch PIT 2 are detected. 

[0013] Based on the logging control signal CONT1 1, the adjustable class classification section 
extract section 1 2 starts the class tap CL 3 corresponding to the class tap CL 1 corresponding 
to front pitch PIT1, the class tap CL 2 corresponding to the current pitch PIT 2, and the back 
tap PIT 3 from the input audio data D10, and supplies it to the class classification section 14 by 
making these into the class tap data D12. 

[0014] The class classification section 14 is ADRC (Adaptive Dynamic Range Coding) which 
unifies these individually in each pitch (front pitch PIT1 , the current pitch PIT 2, and the back 
tap PIT 3) of every, compresses the class tap, and generates a compression data pattern about 
the class tap started in the class classification extract section 1 2. It has the circuit section and 
the class code generating circuit section which generates the class code to which the class tap 
data D1 2 belong. 

[0015] The ADRC circuit section forms pattern compressed data by performing an operation 
which is compressed into 2 bits from 8 bits as opposed to the class tap data D12. Since this 
ADRC circuit section can perform accommodative quantization and can express the local 
pattern of signal level efficiently by the short word length here, it is used for code generating of 
a class classification of a signal pattern. 

[0016] When it is going to carry out the class classification of the six 8-bit data on an audio 
wave (class tap), it must classify into a huge number 248 of classes, and, specifically, the burden 
on a circuit increases. So, in the class classification section 14 of the gestalt of this operation, a 
class classification is performed based on the pattern compressed data generated in the ADRC 
circuit section prepared in that interior. For example, if 1-bit quantization is performed to six 
class taps, six class taps can be expressed with 6 bits, and can be classified into 26 = 64 class. 
[0017] Here, the ADRC circuit section is a degree type and [0018], when the data level of m and 
each class tap is set to L and a quantization code is set [ the dynamic range of the class tap in 
the started field (audio wave) ] to Q for this [ DR and / bit rate ]. 
[Equation 1] 
DR=MAX-MIN+1 

Q= I (L-MIN+ 0.5) X2 - /DR) (1) 
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[0019] It is alike, and it follows and quantizes by dividing equally by the bit length which had 
between the maximum MAX in a field, and the minimum values MIN specified. In addition, in (1) 
type, { ) means the cut-off processing below decimal point. In this way, supposing six class taps 
consist of 8 bits (m= 8), for example, respectively, as for these, each will be compressed into 2 
bits in the ADRC circuit section. 

[0020] Thus, if the compressed audio data point (class tap) is set to qn (n=1-6), respectively, the 
class code generating circuit section prepared in the class classification section 1 4 is the 
compressed audio data point qn. It is based and is a degree type and [0021]. 
[Equation 2] 

Class - Eq, (2 P )» (2) 

[0022] Class code class which shows the class to which the block (q1 -q6) belongs by being 
alike and performing the shown operation Class code class based on the computed class tap 
data D12 concerned while computing It considers as the class code data D14, and the prediction 
coefficient memory 15 is supplied. This class code class The read-out address at the time of 
reading a prediction coefficient from the prediction coefficient memory 15 is shown. Incidentally 
it is the audio data point (class tap) qn into which n was compressed in (2) types. A number is 
expressed and, in the case of the gestalt of this operation, it is n= 6, and P expresses bit 
assignment and, in the case of the gestalt of this operation, is P= 2. 

[0023] Thus, the class classification section 14 is the class code of the class tap started from 
the input audio data D10 in the class classification section extract section 12. class is generated 
and the prediction coefficient memory 1 5 is supplied by making this into the class code data 
D14. 

[0024] Set w1 -wn of the prediction coefficient which the set of the prediction coefficient 
corresponding to each class code is memorized to the address corresponding to a class code by 
the prediction coefficient memory 15, respectively, and is memorized to the address 
corresponding to the class code concerned based on the class code data D1 4 supplied from the 
class classification section 14 It is read and the prediction operation part 16 is supplied. 
[0025] The prediction operation part 16 is [ the audio data point (prediction tap) D13 (x1 -xn) 
which was started according to the logging control signal CONT1 1 supplied from the pitch 
detecting element 11 in the prediction operation part extract section 13 and which is going to 
carry out a prediction operation, and ] prediction coefficients wl-wn. It receives and is a degree 
type [0026]. 
[Equation 3] 

y' =wi xi + w* xi + + Wo x» (3) 

[0027] Prediction result y' is obtained by being alike and performing the shown sum-of-products 
operation. This forecast y' is outputted from the prediction operation part 16 as audio data D16 
with which tone quality has been improved. 

[0028] When it is incidentally judged that there is no pitch in the input audio data D10 in the 
pitch detecting element 1 1 , the pitch detecting element 1 1 performs a class classification and a 
prediction operation by the audio wave near the current data by supplying the control signal 
CONT1 1 for starting the level of an audio wave near the current data, as shown in drawing 3 to 
the adjustable class classification section extract section 1 2 and the adjustable prediction 
operation part extract section 1 3, without using front pitch PIT1 and the back pitch PIT 3. 
[0029] In addition, although functional block mentioned above about drawing 1 as a configuration 
of the audio signal processor 10 was shown, the equipment of a computer configuration shown in 
drawing 4 in the gestalt of this operation as a concrete configuration which constitutes this 
functional block is used. In drawing 4 namely, the audio signal processor 10 It has RAM (Random 
Access Memory)15 which constitutes CPU21, ROM (Read Only Memory )22, and the prediction 
coefficient memory 15 through Bus BUS, and the configuration to which each circuit section was 
connected, respectively. By performing the various programs stored in ROM22, CPU1 1 It is made 
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as [ operate / as each functional block (the pitch detecting element 11, the class classification 
section extract section 1 2, the prediction operation part extract section 1 3, the class 
classification section 14, and prediction operation part 16) mentioned above about draw ing 1 ]. 
[0030] Moreover, it has the removable drive 28 which reads information from external storage, 
such as the communication link interface 24 and floppy disk which communicate between 
networks, and a magneto-optic disk, to the audio signal processor 10, each program for 
performing class classification application processing mentioned above about drawing 1 from a 
network course or external storage can be read to the hard disk of a hard disk drive unit 25, and 
class classification adaptation processing can also be performed according to ****** and the 
read program concerned. 

[0031] A user performs class classification processing mentioned above about drawing 1 to 
CPU21 by inputting various commands through the input means 26, such as a keyboard and a 
mouse. In this case, after the audio signal processor 10 inputs the audio data (input audio data) 
D10 which are going to raise tone quality through the data I/O section 27 and performs class 
classification application processing to the input audio data D10 concerned, it is made as 
[ output / through the data I/O section 27 / the audio data D16 whose tone quality improved / 
outside ]. 

[0032] Incidentally, drawing 5 shows the procedure of the class classification adaptation 
processing in the audio signal processor 10, and if the audio signal processor 10 goes into the 
procedure concerned from a step SP 11, it will compute the pitch of the input audio data D10 in 
the pitch detecting element 11 in the continuing step SP 12. 

[0033] This computed pitch (front pitch P1T1, the current pitch PIT 2, the back pitch PIT 3) is 
also that for ensuring the class classification of an OTIO wave much more, and the audio signal 
processor 10 carries out the class classification of the class tap (audio wave) by the class 
classification section 14 in a step SP 14, after starting the class tap according to a pitch in a 
step SP 13. And the audio signal processor 10 reads a prediction coefficient from the prediction 
coefficient memory 15 using the class code obtained as a result of the class classification. This 
prediction coefficient corresponds for every class by study beforehand, and is stored, and the 
audio signal processor 10 can use the prediction coefficient corresponding to the description of 
an audio wave at this time by reading the prediction coefficient corresponding to a class code. 
[0034] The prediction coefficient read from the prediction coefficient memory 1 5 is used for the 
prediction operation of the prediction operation part 1 6 in a step SP 1 5. Thereby, the input audio 
data D10 are changed into the audio data D16 which a prediction operation is carried out using 
the class code obtained by the class tap according to the pitch, and are considered as a request. 
In this way, the input audio data D10 are changed into the audio data D16 with which the tone 
quality has been improved, and the audio signal processor 10 moves to a step SP 16, and ends 
the procedure concerned. 

[0035] Next, the study circuit for obtaining beforehand the set of the prediction coefficient for 
every class memorized to the prediction coefficient memory 15 mentioned above about drawing 
1 by study is explained. 

[0036] In drawing 6 , the study circuit 30 receives the teacher audio data D30 of the quality of 
loud sound in the student signal generation filter 37. The student signal generation filter 37 is 
made as [ lengthen / the teacher audio data D30 / for every predetermined time / at the rate of 
infanticide set up by the rate setting signal D39 of infanticide / between predetermined 
samples ]. 

[0037] In this case, the prediction coefficient generated changes with rates of infanticide in the 
student signal generation filter 37, and the audio data reproduced with the above-mentioned 
audio signal processor 10 according to this also differ. For example, when it is going to improve 
the tone quality of audio data by making a sampling frequency high in the above-mentioned audio 
signal processor 10, infanticide processing which reduces a sampling frequency is performed with 
the student signal generation filter 37. Moreover, when aiming at improvement in tone quality by 
compensating the data sample which lacked the input audio data D10 in the above-mentioned 
audio signal processor 10 to this, according to this, it is made as [ perform / infanticide 
processing made to lack a data sample ] with the student signal generation filter 37. 
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[0038] In this way, the student signal generation filter 37 generates the student audio data D37 
by predetermined infanticide processing from the teacher audio data 30, and supplies this to the 
pitch detecting element 31, the class classification section extract section 32, and the prediction 
operation part extract section 33, respectively. 

[0039] About the student audio data D37 supplied from the student signal generation filter 37, 
the pitch detecting element 31 detects the pitch (fundamental frequency) mentioned above 
about drawing 2 and drawing 3 based on the autocorrelation of the student audio data D37. 
[0040] And the pitch detecting element 31 will supply the logging control signal CONT31 for 
starting these to the adjustable class classification section extract section 32 and the adjustable 
prediction operation part extract section 33, if front pitch PIT1 and the back pitch PIT 3 
corresponding to the current pitch PIT 2 are detected. 

[0041] Based on the logging control signal CONT31, the adjustable class classification section 
extract section 32 starts the class tap CL 3 corresponding to the class tap CL 1 corresponding 
to front pitch PIT1, the class tap CL 2 corresponding to the current pitch PIT 2, and the back 
tap PIT 3 from the student audio data D37 f and supplies it to the class classification section 34 
by making these into the class tap data D32. 

[0042] The class classification section 34 is ADRC (Adaptive Dynamic Range Coding) which 
compresses the class tap concerned and generates a compression data pattern about the class 
tap started in the class classification extract section 32. It has the circuit section and the class 
code generating circuit section which generates the class code to which the class tap data D32 
belong. 

[0043] The ADRC circuit section forms pattern compressed data by performing an operation 
which is compressed into 2 bits from 8 bits as opposed to the class tap data D32. Since this 
ADRC circuit section can perform accommodative quantization and can express the local 
pattern of signal level efficiently by the short word length here, it is used for code generating of 
a class classification of a signal pattern. 

[0044] When it is going to carry out the class classification of the six 8-bit data on an envelope 
wave (class tap), it must classify into a huge number 248 of classes, and, specifically, the burden 
on a circuit increases. So, in the class classification section 14 of the gestalt of this operation, a 
class classification is performed based on the pattern compressed data generated in the ADRC 
circuit section prepared in that interior. For example, if 1-bit quantization is performed to six 
class taps, six class taps can be expressed with 6 bits, and can be classified into 26 = 64 class. 
[0045] Here, the ADRC circuit section sets [ the dynamic range of the class tap in the started 
field (audio wave) ] a quantization code to Q for this [ DR and / bit rate ], setting the data level 
of m and each class tap as L, and quantizes by dividing equally by the bit length which had 
between the maximum MAX in a field, and the minimum values MIN specified by the same 
operation as above-mentioned (1) type. In this way, supposing six data points on an audio wave 
consist of 8 bits (m= 8), for example, respectively, as for these, each will be compressed into 2 
bits in the ADRC circuit section. 

[0046] If the compressed audio data point (class tap) is set to qn (n=1-6), respectively, thus, the 
class code generating circuit section prepared in the class classification section 34 Compressed 
audio data point qn By being based and performing the same operation as above-mentioned (2) 
types Class code class which shows the class to which the block (q1 -q6) belongs It computes 
and is the computed class code class concerned. It considers as the class code data D34, and 
the prediction coefficient calculation section 36 is supplied. Incidentally it is the audio data point 
(class tap) qn into which n was compressed in (2) types. A number is expressed and, in the case 
of the gestalt of this operation, it is n= 6, and P expresses bit assignment and, in the case of the 
gestalt of this operation, is P= 2. 

[0047] Thus, the class classification section 34 generates the class code data D34, and supplies 
this to the prediction coefficient calculation section 36. Moreover, the audio data point 
(prediction tap) D13 (x1 -xn) which was started according to the logging control signal CONT11 
supplied from the pitch detecting element 1 1 in the prediction operation part extract section 33 
and which is going to carry out a prediction operation is supplied to the prediction coefficient 
calculation section 36. 
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[0048] the class code class and each class code class to which the prediction coefficient 
calculation section 36 was supplied from the class classification section 34 every — a normal 
equation is stood using the started prediction tap D33 and the teacher audio data D30 of the 
quality of loud sound supplied from the input edge TIN. 

[0049] namely, the level of n sample of the student audio data D37 — respectively — x1 f x2 

xn ****** — the quantization data of the result of having been alike, respectively and having 
performed p-bit ADRC — ql, ... qn ** — it carries out. At this time, class code class' of this 

field is defined like above-mentioned (2) types. And they are x1 f x2 xn about the level of the 

student audio data D37 as mentioned above, respectively. When it carries out and level of the 
teacher audio data D30 of the quality of loud sound is set to y, they are a prediction coefficient 
wl , and w2, .., wn for every class code. The linearity presumption type of n tap to depend is set 
up. It is this A degree type and [0050] 
[Equation 4] 

y=wi xi +wj xi + -t-Wn x« (4) 

[0051] It carries out. Before study, it is wn. It is an undetermined coefficient. 

[0052] In the study circuit 30, it learns to two or more audio data for every class code. When a 

data measurement size is M, above-mentioned (4) types are followed, and it is a degree type and 

[0053]. 

[Equation 5] 

y» — Wi Xxi + Wb X*_ + W„ Xk, (5) 

[0054] It ******. however, k= —1, 2, and .... it is M. 

[0055] In M>n, they are a prediction coefficient wl and ....wn. Since it is not decided uniquely, it 
is the element of the error vector e A degree type and [0056] 
[Equation 6] 

e_ = y fc — {wi Xfcj+Wi x_»4- w» x_-} (6) 

[0057] It is defined as alike (however, k= 1, 2, M), and is a degree type and [0058]. 
[Equation 7] 

e a = S e\ (7) 

k-0 

[0059] It asks for the prediction coefficient made into min. It is a solution method by the so- 
called least square method. 

[0060] wn according to (7) types here It asks for a partial differential coefficient. In this case, a 
degree type, [0061] 
[Equation 8] 

Be 2 u t 9 e ± ^ M 
= S 2 e_ = S 2 x_. • e_ 

dw i k -° 1 aw i J 



= S 2 Xki • e* (1 = 1. 2 n) (8) 

k-0 

[0062] What is necessary is just to calculate each wn (n=1-6) so that it may be made "0. 
[0063] And a degree type, [0064] 
[Equation 9] 



Xi,= Z x P i* x P i (9) 

[0065] 

[Equation 10] 

Yi = ix„-y» (10) 

k-0 



[0066] ** — like — Xij and Yi if a definition is given — (8) types — a matrix — using — a 
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degree type and [0067] 
Equation 1 1] 



Xm 


Xl2 


X 1 D 




W i 




Y, 


x>. 


x aa 


...... x fcn 




W ! 




Y, 


Xnt 




X sin 




w„ 




Y. 



(11) 



[0068] It is expressed by carrying out. 

[0069] Generally this equation is called the normal equation. In addition, it is n= 6 here. 
[0070] the prediction coefficient calculation section 36 after the input of all the data for study 
(the teacher audio data D30, the class code class, and prediction tap D33) is completed — each 
class code class the normal equation having shown in above-mentioned (11) equations — 
standing — this normal equation — sweeping out — general matrix solution methods, such as 
law, — using — every — Wn ******** — it solves and a prediction coefficient is computed for 
every class code. The prediction coefficient calculation section 36 writes each computed 
prediction coefficient (D36) in the prediction coefficient memory 1 5. 

[0071] As a result of performing such study, in the prediction coefficient memory 15, they are 

the quantization data q1 , q6. The prediction coefficient for presuming the audio data y of the 

quality of loud sound is stored for every class code for every pattern specified. This prediction 
coefficient memory 15 is used in the audio signal processor 10 mentioned above about drawing 
1 . By this processing, study of the prediction coefficient for creating the audio data of the 
quality of loud sound from the usual audio data according to a linearity presumption type is 
completed. 

[0072] Thus, the study circuit 30 can generate the prediction coefficient for the interpolation 
processing in the audio signal processor 10 in consideration of extent which performs 
interpolation processing in the audio signal processor 10 by performing infanticide processing of 
the teacher audio data of the quality of loud sound with the student signal generation filter 37. 
[0073] In the above configuration, the audio signal processor 10 can extract the wave part which 
had especially regularity among the input audio data D10 by detecting the pitch of the input 
audio data D10 in the pitch detecting element 11. 

[0074] When they differ for every individual even if the wave part (pitch) with this regularity is 
the phoneme same in human being's voice, starts this part correctly and carries out a class 
classification, the class division of the different description of an audio wave for every individual 
is carried out clearly much more. 

[0075] Thus, by performing the prediction operation based on a class part opium poppy and the 
class classification result concerned for the description of an audio wave clearly, the 
repeatability of an audio wave improves and improvement in tone quality is achieved. 
[0076] According to the above configuration, the input audio data D10 are convertible for the 
audio data D16 of the quality of loud sound much more by having controlled the togging range of 
a tap based on the detection result of a pitch. 

[0077] In addition, although the case where the adjustable class classification section extract 
sections 12 and 32 were supplied in the audio signal processor 10 and the study circuit 30 in the 
gestalt of above-mentioned operation by making into a pitch class the pitch size detected by the 
pitch detecting elements 1 1 and 31 and its location was described As this invention is shown in 
drawing 7 and drawing 8 which attach and show the same sign to a corresponding point not only 
with this but drawing 1 and drawing 6 You may make it supply the number of pitches detected by 
pitch detecting-element IV and 3V to class classification section 14' and 34' as pitch class data 
D11andD31. 

[0078] In this case, class classification section 14* and 34' are the class code class of the pitch 
class data D11 and D31 supplied from the adjustable class classification section extract section 
12, and the class tap data D12 and D32 divided into the same time domain. Class code CLASS of 
the pitch class data D1 1 and D31 The class code data (class ') D14 and D34 which were made to 
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correspond and were unified are generated. 

[0079] Thus, if the number of pitches detected by pitch detecting-element IT and 31' is 
supplied to class classification section 14' and 34' as pitch class data D11 and D31 The 
frequency of a class classification can be made [ many ] much more, and the audio signal 
processor 10 which performs the prediction operation of input audio data in this way using the 
prediction coefficient based on the result by which the class classification was carried out can 
be changed into the audio data of the quality of loud sound much more. 
[0080] Moreover, in the gestalt of above-mentioned operation, although the case where an 
autocorrelation was used as an approach of detecting the pitch of an audio wave was described, 
this invention can apply other various approaches, such as cepstrum analysis, pattern 
recognition, etc. which perform the Fourier transform for example, not only to this but to an 
audio wave. 

[0081] Moreover, in the gestalt of above-mentioned operation, although the case where pitch 
detection and a class classification were performed about the wave of the time-axis field of the 
input audio data D10 was described, it develops not only to this but to a frequency domain, and 
this invention may be made to perform pitch detection and a class classification. 
[0082] Moreover, although the case where the logging size of the tap for a class classification 
and the logging field of the tap for a prediction operation were made into the same size was 
described in the gestalt of above-mentioned operation You may make it the logging size of not 
only this but the tap for a class classification differ from the logging size of the tap for a 
prediction operation, this invention is further started according to the strength of the 
autocorrelation detected in the pitch detecting element 11, and you may make it change size. In 
this case, it can start, so that an autocorrelation is strong, and size can be enlarged. 
[0083] moreover, the result of in short having learned this invention not only in this although the 
case where the technique by linearity primary was used as a prediction method was described in 
the gestalt of above-mentioned operation — using — it makes — ****ing — for example, — 
many — various prediction methods, such as technique by degree function, are applicable. 
[0084] moreover — although the case where ADRC generated a compression data pattern in the 
class classification section 14 was described in the gestalt of above-mentioned operation — this 
invention — not only this but reversible coding (DPCM:Differrential Pulse Code Modulation) Or 
vector quantization (VQ:Vector Quantize) etc. — you may make it use a compression means 
[0085] Moreover, in the gestalt of above-mentioned operation, although the case where a 
predetermined sample was thinned out from the teacher audio data D30 in the student signal 
generation filter 37 of the study circuit 30 was described, this invention can apply other various 
approaches, such as thinning out not only this but the number of bits. 
[0086] 

[Effect of the Invention] As mentioned above, according to this invention, the pitch of a digital 
audio signal can be detected, the class can be classified based on the detected pitch concerned, 
and conversion which was adapted for the description of a digital audio signal much more can be 
performed by having changed the digital audio signal by the prediction method corresponding to 
the classified class. 
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* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

LThis document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is the block diagram showing the configuration of the digital-signal-processing 
equipment by this invention. 

[Drawing 2] It is the signal waveform diagram with which explanation of the pitch of a digital 
audio signal is presented. 

[Drawing 3] It is the signal waveform diagram with which explanation of processing of a digital 
audio signal without a pitch is presented. 

[Drawing 4] It is the block diagram showing the configuration of an audio signal processor. 
[Drawing 5] It is the flow chart which shows an audio signal transform-processing procedure. 
[Drawing 6] It is the block diagram showing the configuration of the study equipment by this 
invention. 

[Drawing 7 ] It is the block diagram showing the configuration of the digital-signal-processing 
equipment by the gestalt of other operations. 

[Drawing 8] It is the block diagram showing the configuration of the study equipment by the 
gestalt of other operations. 
[Description of Notations] 

10 [ .. A prediction coefficient memory, 16 / .. Prediction operation part, 36 / .. The prediction 
coefficient calculation section, 37 / .. Student signal generation filter. ] .... 1 1 An audio signal 
processor, 31 .. 14 A pitch detecting element, 34 .. The class classification section, 15 
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